Collective Sequential Pattern Mining in Distributed Evolving Data Streams
نویسندگان
چکیده
The advances in processing and communication techniques resulted in a multitude of emerging applications that interact with streams of data. Traditional data mining systems store arriving data, collect them for later mining, and make multiple passes over the collected data. Unfortunately, these systems are prohibitively slow when they deal with data streams with massive amounts of data arriving at high rates. This paper introduces a new model for mining sequential patterns on distributed data streams environments. It focuses on evolving data streams that originate from multiple distributed sources. Moreover, the mining process is achieved without compromising the privacy of the individual data streams of the participant nodes. Simulation results show that the proposed model scales linearly with the number of distributed nodes. In addition, it reduces the overhead in the distributed mining process.
منابع مشابه
Look Over on Mining Sequential Patterns in Evolving Data Stream
Data Stream are sequence of digitally encoded coherent signals ( Packets of data or data packets ) used to send or receive information that is in the process of being transmitted. It is a continuous, rapid and time-varying streams of data elements. A growing number of applications generate the streams of data. Such continuous generation of new elements in a data stream adds on additional constr...
متن کاملIncremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
Sequential pattern mining is the mining of data sequences for frequent sequential patterns with time sequence, which has a wide application. Data streams are streams of data that arrive at high speed. Due to the limitation of memory capacity and the need of real-time mining, the results of mining need to be updated in real time. Multiple data streams are the simultaneous arrival of a plurality ...
متن کاملSequential Pattern Mining of Multimodal Streams in the Humanities
Research in the humanities is increasingly attracted by data mining and data management techniques in order to efficiently deal with complex scientific corpora. Particularly, the exploration of hidden patterns within different types of data streams arising from psycholinguistic experiments is of growing interest in the area of translation process research. In order to support psycholinguistic e...
متن کاملSequential Pattern Mining for Uncertain Data Streams using Sequential Sketch
Uncertainty is inherent in data streams, and present new challenges to data streams mining. For continuous arriving and large size of data streams, modeling sequences of uncertain time series data streams require significantly more space. Therefore, it is important to construct compressed representation for storing uncertain time series data. Based on granules, sequential sketches are created t...
متن کاملPredicting Sequential Pattern Changes in Data Streams
Data streams are utilized in an increasing number of real-time information technology applications. Unlike traditional datasets, data streams are temporally ordered, fast changing and massive. Due to their tremendous volume, performing multiple scans of the entire data stream is impractical. Thus, traditional sequential pattern mining algorithms cannot be applied. Accordingly, the present study...
متن کامل